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Abstract 

The use of Geographic Information Systems has increased considerably since the eighties and nineties. As one of their 
most demanding applications we can mention shortest paths search. Several studies about shortest path search show 
the feasibility of using graphs for this purpose. Dijkstra's algorithm is one of the classic shortest path search algorithms. 
This algorithm is not well suited for shortest path search in large graphs. This is the reason why various modifications 
to Dijkstra's algorithm have been proposed by several authors using heuristics to reduce the run time of shortest path 
search. One of the most used heuristic algorithms is the A* algorithm, the main goal is to reduce the run time by 
reducing the search space. This article proposes a modification of Dijkstra's shortest path search algorithm in reduced 
graphs. It shows that the cost of the path found in this work, is equal to the cost of the path found using Dijkstra's 
algorithm in the original graph. The results of finding the shortest path, applying the proposed algorithm, Dijkstra's 
algorithm and A* algorithm, are compared. This comparison shows that, by applying the approach proposed, it is 
possible to obtain the optimal path in a similar or even in less time than when using heuristic algorithms. 

Keywords: Shortest path search algorithm; Geographic Information Systems; Network analysis; Reduced graphs; 
Dijkstra's algorithm 



Introduction 

From a practical point of view, a Geographic Information 
System (GIS) is a computer system capable of handling 
georeferenced data. These kinds of data refer to informa- 
tion associated with geographic coordinates (longitude, 
latitude). A GIS should also facilitate the relationship 
between socio-economic data (i.e. population density) and 
geographic data, this can be achieved through the gener- 
ation of thematic maps (Jiang et al. 2010), a service for 
generating this kind of maps is described by (Rodriguez- 
Torres and Rodriguez-Puente 2010). The relevance of a 
GIS is closely related to the ability of building mod- 
els or representations coming from the real world. This 
kind of system is very important because it facilitates the 
decision-making process and has a high social impact. 
Among the most demanded features in GIS we can men- 
tion those related to the analysis of routes, some examples 
are as follows: 
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- What is the shortest path between places x and yl 

- What is the optimal path between places x and y 
considering a certain criterion? 

- What is the lowest cost path between x and y via 
places Xi,X2, ■ ■ • ,^«? 

Shortest path search has been widely studied. Many 
applications can be found in various branches of science, 
specifically in GIS. The road networks used by GIS to 
respond to the above requests are usually large and could 
have thousands of streets, that is why one should pay 
particular attention to how such information is processed. 

One of the classic and most used algorithms for calcu- 
lating the shortest path from an origin to a destination 
is Dijkstra's algorithm, it was first enunciated by Edsger 
Wybe Dijkstra (1959) and is one of the most used and 
discussed algorithms in the literature of graphs, the tem- 
poral complexity is 0(|£| + |K|/o^|V^|), where |£| is the 
number of edges and | V\ is the number of vertices of 
the graph. However, this algorithm is not efficient for 
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searching shortest path in large graphs (Fuhao and Jiping 
2009). 

Various modifications to Dijkstra's algorithm have been 

proposed by several authors. Some of these algorithms use 
heuristics to reduce the run time of shortest path search 
and we can classify them as follows: 

1. Without data preprocessing, i.e.: 

- A"* (A-star) algorithm (Hart et al. 1968). 
Improved Live long planing A* (Huang et al. 

2007). 

- Bidirectional search (Pohl 1969). 

- In (Nazari et al. 2008) an approach based on 
restrictions on the search space is proposed. 

2. With data preprocessing, i.e.: 

- Reach-Based Pruning (Gutman 2004). 

- Landmark-A* (Goldberg and Harrelson 2005; 
Goldberg and Werneck 2005). 

- Highway Hierarchies (Geisberger et al. 2008; 
Jagadeesh and Srikanthan 2008; Sanders and 
Schultes 2005; Song and Wang 2011; Wang 
et al. 2006). 

- Edge flags (Koehler et al. 2005; Mohring et al. 

2006) . 

- Geometric containers (Wagner and Willhalm 

2007) . 

- Precomputed Cluster Distances (PCD) (Maue 
et al. 2010). 

Delling et al. (2009) show an overview of routing 
algorithms; all approaches show important advances in 
shortest path search and make possible a low response 
time in large graphs using heuristics. 

One of the most used heuristic algorithms is the A* algo- 
rithm, the main goal is to reduce the run time by reducing 
the search space analyzing only the vertices that have bet- 
ter possibilities to appear in the shortest path. The results 
obtained by this algorithm depend on the heuristic func- 
tion used to determine the order in which vertices are 
visited. If the selected heuristic is optimal the compu- 
tational complexity is reduced to 0(«). That is why the 
A* algorithm is widely used for shortest path search. 

One approach studied for shortest path search on large 
graphs is related to the use of some properties of the 
road networks, mainly to reduce the search space of the 
shortest path. 

In the following paragraphs we will be referring to some 
relevant researches: 

- Gutman proposes an approach (Gutman 2004) in 
which he defines a formal attribute of vertex called 
reach, in order to measure vertex relevance. The 



reach attribute is precalculated using the graph to 
reduce the run time of shortest path search. 

- A relevant approach that uses a property of a road 
network is related to the hierarchy present in this kind 
of network. Many strategies use this approach, for 
example, Sanders and Schultes propose algorithms 
for constructing and querying highway hierarchies 
achieving a small run time and show the feasibility of 
this approach (Sanders and Schultes 2005). 

- Bast et al. define an approach based on relevant nodes 
(transit nodes) for long-distance travel (Bast et al. 
2007). It consists of making precalculations of 
shortest path between all pairs of transit nodes and 
from each potential source or destination to its access 
transit nodes. This approach needs an effective 
notion of "far away" and the optimal results are 
guaranteed depending on the local filter selected. 

- Gonzalez et al. use the hierarchy of roads for 
partitioning the network into areas and make 
precalculations of shortest path in these areas 
(Gonzalez et al. 2007). This approach uses the fact 
that some roads are more traveled than others and 
drivers usually use the largest roads. 

- Geisberger et al. propose an approach that uses only 
edges that are related with "important" nodes 
(Geisberger et al. 2008). Pfoser et al. present a 
shortest path algorithm that imitates human driving 
behavior by exploiting road network hierarchies 
(Pfoser et al. 2009). 

As an important characteristic of the approaches 
described above, it may be determined that they are based 
on the idea that for calculation of large paths (in large net- 
works), only high levels roads (highways, roads more tra- 
versed, etc.) of the hierarchical road network are needed. 
This consideration can reduce the run time of shortest 
path search algorithms, but can not guarantee to return 
the optimal path. 

Various commercial systems use heuristic algorithms 
with the aim of reducing the run time (Bast et al. 2007). 
Various authors have defined heuristics for achieving this 
goal (Fei et al. 2010; Liu and Yang 2009; Nazari et al. 
2008; Sun et al. 2008; Xu 2005). Fu et al. show a review 
of this kind of algorithms for shortest path search in 
transportation applications (Fu et al. 2006). 

Heuristic algorithms are relevant for shortest path 
search in large graphs, even when an error is introduced, 
acceptable in most of the situations, but they do not 
guarantee to obtain the optimal path in all cases. 

On the other hand, there are algorithms for reducing 
a graph (Liu et al. 2010; Lu and Liu 2007; Sadiq and 
Orlowska 2000). With the application of any algorithm 
on the reduced graph, obviously, a lower response time is 
achieved. However, in this case, reduction of data brings 
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loss of information. Thus, obtaining a path that is the 
optimal in the original graph can not be guaranteed. 

Rodriguez-Puente proposes a graph reduction algo- 
rithm without loss of information (Rodriguez-Puente 
2010). It specifies a mechanism to obtain the original 
graph from which the reduced graph was obtained. This 
algorithm can be applied naturally to a GIS because a map 
is usually divided into: zip code, states, regions, etc. This 
fragmentation of the map contribute to create a partition 
according to the algorithm requirements. This algorithm 
has a computational complexity 0{n'^), which is a high 
cost for a response in real-time environment. However, 
in the proposed approach we make a graph reduction for 
each graph, only once, and the execution of the reduction 
algorithm is done only for data preprocessing. Highlight- 
ing that it does not affect the run time of shortest path 
search. 

This article presents a modification of Dijkstra's short- 
est path search algorithm. It shows that it is possible to 
obtain the lowest cost path in all cases in a time similar 
to A* algorithm. Thus, the application of this algorithm 
in GIS can make improvements in services provided by 
this kind of systems. The use of the proposed algorithm 
integrated with the mentioned reduction algorithm will 
ensure efficiency in shortest path search, while maintain- 
ing accuracy. 

The paper is organized as follows: first, a brief descrip- 
tion of the graph reduction algorithm is provided. Second, 
the algorithm for finding shortest paths in reduced graphs 
is presented. Then, correctness of the algorithm is proved. 
Finally, some experimental results and conclusions are 
discussed. 

Graph reduction 

In order to achieve a better understanding of the pro- 
posal, certain definitions and notations related with graph 
theory must be introduced. Then, the selected graph 
reduction algorithm, used in the proposed approach, is 
presented. 

Definitions and notations 

Relevant definitions and notations related to the proposed 
approach are as follows: 

Definition 1, A graph is a pair G = (V,E), where: 

- V is a set of vertices. 

- £ is a set of edges. An edge is an unordered pair of 
vertices (v;, Vj) such that v/, v/ e V. 

Definition 2. A weighted graph is defined as a structure 
G= (V,£,/c), where: 

- y is a set of vertices. 

- £ is a set of edges. 



- The function : £ ^ R+ assigns to each edge a 
positive real value called cost. 

Definition 3. A graph rewrite rule R — (G,-, Gj, i/fj„, ifout) 
over a graph G = {V,E,fc) consists of: 

- a graph G, = ({v,}, 0), where e V. 

- a graph G; = (V/,£;). 

- two sets of embedding information ijfin, xjrout of the 
form {{Vm, ci, C2, v„)}, where: 

ci, C2 e M"*", Vm e V), v„ e {V — Vj]; in the case of 
V^m, ^(vn, Vi) e £, such that /c(v„, v/) = ci. After 
applying the rewrite rule, a new graph 
H = (Vi, E\,fc\) is obtained and it holds that 
3{v„, Vm) e El, such that fci{v„, v^) - c^. 
Analogously to ^in, we define V^oMt> with edges 
orientation as the only difference. 

El = E — EfLIEjU Ej(, (Vfi, Vt2) G Et if and only if 
(vti = Vi and Vti e V) or {Va e V and vq = v;). 
{Vm, v„) e Ei^ if and only if (v^, ci, C2) e (V^in U V^oMf). 
ci, C2 e M+./ci : £ ^ R+, 



fc(u, v) if (u, v) e Eandu, v ^ v; 
fq{u, v) if (m, v) e Ej 

ci if3iu,ci,C2,v) e iifi„ U V^owf) 



A graph rewrite rule also can be defined over an undi- 
rected graph, in this case, the sets Vf;„ and V^oMt must be 
represented as an only set called ^. 

The set of edges that join vertex v, with the vertices of 
the graph G — G, are called pre-embedding edges. After 
applying a rewrite rule, the edges that join a vertex of the 
graph Gj with a vertex of the graph G — Gj are called 
post-embedding edges. The function i/f;„ transforms the 
set of pre-embedding edges that are incident in a vertex 
Vi in post-embedding edges that are incident in one or 
more vertices Vj € Vj. Similarly, the function xlfout trans- 
forms pre-embedding outgoing edges from a vertex v; in 
one or more post-embedding outgoing edges from several 
vertices Vj e Vj. 

Definition 4. A reduced graph is a tuple G^ = 
(Vr,Er,f,R), where: 

- is a set of vertices. 

- £r is a set of edges. 

- f:VrxVrxVr^ (R+ [ji^, oo}), is a function that 
for each (v,-, Vj, v^) returns the cost of going from v/ to 
Vk through Vj, with v^ adjacent to Vj and Vj adjacent to 
V,-. Function f is obviously also defined for the cases 
where Vi = Vj and/or Vj = v^. In the trivial case, 
/(v,v,v) = 0. 

- R isa set of rewrite rules over (Vr, Er,fc), where fc is 
defined as/c(v, w) =f{v, v, w). 
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This definition is particularly important when it is asso- 
ciated with another graph, i.e., when a graph is reduced 
from another graph. We can state that a graph Gr = 
(Vr,Er,f, R) is reduced from a graph G = (V,E,fc), when 
applying the set of rewrite rules R to the graph Gr, the 
graph G is obtained. 

In the case of function / for all 3-tuple of vertices 
Vi, Vj, Vj( e Vr it holds that/(v;, Vj, Vk) = fdvi, Vj)+fc{vj, v^). 
Notice that f{vi, Vi, vj) = fdvi, vj). If and vj are not adja- 
cent, the image of both functions would be infinite. This 
is the way in which we specify that two vertices are not 
adjacent. 

Graph reduction algorithm 

The reduction algorithm enunciated in (Rodn'guez- 
Puente 2010) has as a key characteristic that it guaran- 
tees no loss of information through the incorporation of 
rewrite rules. However, an improved version is presented 
here, since it is necessary to differentiate between what are 
defined as internal and external vertices below. 

This algorithm has two variables as input: a reduced 
graph G = ( V,E,f,R) and a partition over the set of ver- 
tices of the graph. On the other hand, the algorithm has as 
output, a reduced graph. 

In first place, it is necessary to refine partition P in 
order to achieve an optimal path having the same cost 
of the optimal path obtained by Dijkstra's algorithm in 
the original graph; to do this, we introduce the following 
definition: 

Definition 5. Let a graph G = {V,E) and a partition P 
on V, a vertex Vi e is internal if Vvy e V, such that 
and Vj are adjacent, it holds that and vj are in the same 
class of P; i.e. [ Vi] = [vj] otherwise Vi is external. 

For refining P, we use the following strategy: 

- Two vertices Vi and Vj are in the same class of refined 
partition if, and only if: 

- Vi and Vj are in the same class in the original 
partition P. 

- Vi and Vj are internal vertices. 

- For each external vertex a new equivalence class is 
created as a singleton containing only this vertex. 

In Figure 1, we show an example of how to refine a 
partition using definitions of internal and external vertex. 

Next, we create a new vertex w,- for each A; e P, \Ai\ > 
1, i = 1..S. V' = Wi is a set of reduced vertices and V — V' 
is the set of unreduced vertices in the reduced graph. 

We add a vertex in the reduced graph for each class of 
the partition calculated in the previous step. If the car- 
dinality of the class is 1, the vertex is considered as an 
unreduced vertex; in any other case, it is considered as a 



Partition P 




Refined partition using internal and external vertices 




Figure 1 Example of partition refinement. 



reduced one {GetReducedVertices method). Next, a set of 
edges is calculated. One edge can be added to the reduced 
graph if the two vertices of the edge belong to different 
equivalence classes {GetEdges method). With the addition 
of edges to the reduced graph, the cost function fr of the 
reduced graph must be updated. 

The creation of the set of rewrite rules is an essen- 
tial step in the reduction algorithm. With the rewrite 
rules, the original graph can be obtained from the reduced 
graph. Therefore, rewrite rules guarantee no loss of infor- 
mation, and so the reduction process is reversible. 

According to Definition 3, a graph rewrite rule is a 
quadruple of the form (G;, G/, \j/in, i^out)- Then, we create 
a rewrite rule for each reduced vertex in V', where: 

- Gi = ({wi},cl)),Wi e r. 

- Gj = (Ai, Ei,fcj) is a subgraph of G = (V, E,fc, R), 
where exists an edge (u, v) e £; if and only if 

(u, v) e £ and u,v e Ai; in addition fcj(u, v) = fdu, v). 

- Vm is a set of quadruples of the form (v^, ci, C2, v„) 
such that for Vm e Ai and v„ e {V — Ai) and 
(v„,Vm) e £ and Vi) e Er it holds that 

Cl =fcj(Vn,Vmy, andc2 =fcj(Vn,Vi). 

- i^out is a set of quadruples of the form {Vm, ci, C2, v„) 
such that for € Aj and v^ & (V — Aj) and 

(Vfn, v„) e £ and (v,, v„) e Er it holds that 

Cl =fcj{Vm, Vn); and C2 =fcj(Vi, Vn). 

The previous explanation corresponds to the implemen- 
tation of GetRewriteRules method. 

Another step that contributes to obtain the optimal path 
is the calculation of function fy. Function fr stores the cost 
of the shortest path from one vertex to another, traversing 
a reduced one. 

Function fr is calculated, initially, {Updatefr method) for 
each reduced vertex. This step is made in this way: 
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- Create an auxiliary graph. First, this graph is equal to 
the graph Gj = {Vj, Ej,fcj) of the rewrite rule. Second, 
we add to this graph, vertices that are adjacent (in the 
original graph) to vertices of graph Gj (notice that 
these vertices are internal taking into account original 
graph and set Vj), and the edges that connect them. 

- We apply MDijkstra algorithm (see next section) 
using all pairs of related vertices, identified in the 
previous step, as origin and destination vertices. 

- The obtained costs and path are stored in/^. 

Additionally, for all 3-tuples of vertices Vi, vj, e 
V, where Vj is a non-reduced vertex, /r(v„ vj, v/^) = 
f(vi, Vj, Vk)- 

Path from vi to is also stored, with the goal of avoiding 
additional run time, when the shortest path search in a 
reduced graph is retrieved. 

Algorithm 1 provides the detailed pseudo-code of the 
graph reduction algorithm. 



Algorithm 1 GraphReduction 

Input: A reduced graph G = (V,E,f,R), where i? is a set 

of rewrite rules. A partition P on V. 
Output: A reduced graph Gr = (Vr, Er,fr, Rr)- 

P = GetPartition{V,RE) {P = V/RE = 

[Ai,A2, . . .,As}, where A,- =[fl,] , ai & V,i= l..s} 

GetReducedVertices(P) 

GetEdges(P) 

GetRewriteRules(P, G) 

Create the reduced graph Gr = {Vr,Er,fr,Rr) 
foralM; eP, \Ai\ > Ido 

Updatefr{G,Rri.Gi,Rri.Gj,fr) {Rri is the rewrite rule 

associated with the class At} 
end for 
return Gr 



The complexity of the reduction algorithm would be 
determined by steps 6-8. According to the above descrip- 
tion of Updatefr, this method calculates shortest path 
from all external vertices (taking into account the original 
graph) of Vj to all vertices of the auxiliary graph. 

In a graph obtained from a network in a map, a ver- 
tex represents the intersection of two or more lines and 
an edge represents the connection between two intersec- 
tions. That is why, in this land of graph, there are no 
edges that intersect among them. Thus, we can assume 
that graphs representing the modeled network through a 
map are planar. 

Moreover, in a graph with these characteristics, the 
degree of a vertex is generally equal to 4, except in a few 
cases. Thus it is assumed, without loss of generalization, 
that the degree of a graph that represents a network of 
this type is less than or equal to 10. Let ACG"*") the degree 



of G, the auxiliary graph has, at most, a ■ A(G"'") vertices. 
In Updatefr method, MDijkstra algorithm is called for 
each adjacent vertex to any vertex of Vj (see Shortest path 
search algorithm section for temporal complexity of this 
algorithm), so the temporal complexity, in the worst case, 
is: 0{a ■ A(G+) • a ■ A(G+) log(fl • A(G+))) = 0(A(G+)2 • 
fl2 . log(fl) + log{MG+))) 

The terms involving A(G+) are constant, so the tempo- 
ral complexity is 0{a^ ■ log(fl)). 

As a conclusion, the temporal complexity of Algo- 
rithm 1 is of polynomial order. The reduction process is 
made only once, as data preprocessing. This preprocess- 
ing task causes an increased in the spatial complexity but, 
with this approach, we can obtain lower run time in every 
shortest path computation over the reduced graph. 

Reduction example 

In this section we explain a very simple example to show 
the reduction process. 
Let: 

- G the graph of Figure 2(a). 

- P = ({vi, V2, V3}, {V4}, {V5}, {ve}, {vy}) a partition over 
the set of vertices of G. 

In first place, we create the reduced vertices, one 
per each equivalence class of P. Thus, after this step, 
Gr = ({Vri, V4, V6, V5, V7}, {},{}, {}). Notice that Vr = 

{Vrl, V4, V6, V5, Vy}, Er = {},fr = {], Rr = {]■ 

Then, we need to calculate the edges of Gr as is speci- 
fied in the description of Algorithm 1. If there is an edge 
between two vertices of G, and these vertices are unre- 
duced in Gr, this edge is added to the reduced graph; for 
example the edge (V5, vy) in G is added to Gr- Addition- 
ally, if there is a vertex v e Pi in a class of P(v e Vr), 
and there exists an edge from v to other vertex u of G 
(u is unreduced vertex in Gr), the edge from the reduced 
vertex, that represents the class Pi of P, to the vertex u is 
added to Gr; for example the edge (V2, V4) in G is added to 
Gr as the edge (v^i, V4), V2 is in the class of P represented 
by Vrl- 

Therefore, the graph of Figure 2(b) is obtained. In addi- 
tion, the rewrite rules are created. The graph G, of the 
rewrite rule is G/ = ({Vri}, {}) (see left of Figure 3), the 
graph Gj is created with the vertices of the class of Pi, 
represented by Vri, and edges among them on G, as is 
presented on the right side of Figure 3. Once we created 
graphs Gi and Gj, the embedding information (i/fj„ and 
i^out) must be specified, as is described in the specification 
of the reduction algorithm. 

Finally, the function fr is calculated. In the example of 
the reduced graph of Figure 2(b), we need to store the path 
from V5 to V4 and the path from V5 to ve, both through v^i. 
In this case,/r(v5, v^i, V4) = 6,fr(v5, Vri, ve) = 9. 
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The application of the rewrite rules obtained (Figure 3) 
to Gr (Figure 2(b)) allows us to obtain the original graph G 
(Figure 2(a)). For this purpose, we enunciated Algorithm 2 
based on Definition 3. 

This algorithm has as input a reduced graph and a 
rewrite rule. If a reduced graph has more than one 
reduced vertex, the application of this algorithm for each 
reduced vertex would be sufficient to obtain the original 
graph. 



- The pre-embedding edge (vri, ve) of cost 3 is 
transformed in post-embedding edge (vs, vs) of cost 3. 

- The pre-embedding edge (vri, vs) of cost 1 is 
transformed in post-embedding edge (v^, V5) of cost 4. 

- The vertex v^i is eliminated from G3. 

After applying the rewrite rule we have obtained the 
graph G (Figure 2(a)). Thus, in the reduction process does 
not exist loss of information, that is, the reduction is 
reversible. 



Algorithm 2 Graph Rewrite Rule Application 

Input: A reduced graph Gr = (Vr,Er,fr, Rr){Rri = 

Gi, Gj, \j/in, xj/gUt is a rewrite rule in Rr associated to a 

reduced vertex of Gr). 
Output: A reduced graph G = (V, E,f, R). 

for all e e £, do 

AddEdge(Gr, e) {Add edge e to graph Gr) 

end for 

for all ci, C2, U2) e V^,>, do 

AddEdge{Gr, {U2, mi, costo = C2)) {Add an edge from 

U2 to Ml of cost C2} 
end for 

for all (Ml, ci, C2, M2) e ifgut do 

AddEdge{Gr, (mi, M2, costo = C2)) {Add an edge from 

Ml to M2 of cost C2] 
end for 

DeleteVertex(Gr, Vr) {Delete vertex Vr from Gr} 
return Gr 



Following, we show an example of application of the 
rewrite rule of Figure 3, using Algorithm 2: 

- Add to Gr (Figure 2(b)) the graph Gj of the rewrite 
rule {Gj is the right side graph of the rewrite rule). 

- The pre-embedding edge (vri, vs) of cost 1 is 
transformed in post-embedding edge (vi, V5) of cost 1. 

- The pre-embedding edge (v^i, Vi) of cost 2 is 
transformed in post-embedding edge (v2, V4) of cost 2. 



Shortest path search algorithm 

In this section, a modification of Dijkstra's shortest 
path search algorithm is shown. The goal of the pro- 
posal is to obtain an optimal path with the same 
cost as the path returned by Dijkstra's algorithm, for 
the same origin and destination, but using a reduced 
graph. 

Both, Dijkstra's algorithm and the one proposed, are 
based on iterations over the set of vertices. At each itera- 
tion, the algorithm will find a vertex so that the distance 
from the origin vertex to the selected vertex is minimal. 




={(Vi,l,l,V5)} 

4'out=-t(V2,2,2,V4),(V3,3,3,V6),(V3,4,4,V5)} 

Figure 3 Rewrite rule example. On the left side is the graph 
Gj = ({i/fi ), {)), on the right side is the graph 
Gj = {[V],V2, v-j], {(vi, 1/2), (V2, v-\), (V2, V3)]) and on the bottom is the 
embedding information ^/fout- 
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This vertex is called pivot. Usually, the vertices are stored 
in a priority queue considering, as priority, the distance 
from the origin vertex. This data structure is used to facil- 
itate the selection of the pivot. Besides, two vectors are 
updated during the execution of the algorithm. One of 
them (vector D) is updated with the lowest distance from 
the origin vertex to each vertex Vi (we refer to this distance 
as D[ Vi]). The other one (vector Pr) is updated with the 
predecessor of each vertex in the shortest path from the 
origin vertex. 

Every time that a pivot w„ is selected, the distances to 
its adjacent vertices are updated. If the distance from the 
origin vertex to the pivot (_D[tv„]) plus the distance from 
the pivot to vertex is lower than the distance from the 
origin vertex to v, {D[ v/]), D[ v/] is updated. 

Additionally, there are two differences between 
Dijkstra's algorithm and the proposed one. 

In the first place, a cost function / : V x V x V ^ 
(/?"*" U {0, oo}) is used for calculating the cost from one 
unreduced vertex to another one, traversing a reduced 
vertex. Notice that, traditionally, the cost function of a 
graph has the cost of an edge. 

The other difference in the proposed algorithm, is 
related to the actualization of distances to a reduced ver- 
tex. Let us consider an unreduced vertex w„ as pivot, it is 
necessary to update the distances to all adjacent vertices 
as described above. If a reduced vertex Vr is adjacent to 
the pivot, we have to update the distances to all vertices 
that are adjacent to Vr (see lines 15-22 of Algorithm 3) 
using the cost function / for guaranteeing the optimal 
result. 

When analyzing the temporal complexity of the 
proposed algorithm, there are two differences with 
respect to Dijkstra's algorithm. The first one is the 
use of function / this function is calculated at pre- 
processing time, so it does not affect the temporal 
complexity. 

The second one implies the execution of one cycle. 
However, it should be noted that this cycle is repeated 
A(G~^) (constant, A(G"'") < 10) times for each vertex that 
is stored in the queue. 

Thus, A(G"'") < log(IV^I) for large graphs, this new 
cycle does not affect the temporal complexity. Con- 
cluding, temporal complexity of Dijkstra and MDijkstra 
algorithms are the same order. Also notice that, in 
a planar graph, we can establish a linear relation 
between vertices and edges. From the Euler's formula 
(Diestel 2010), it follows that |£| < 3|\^| - 6 if |K| > 
3. So, in the case of Dijkstra's algorithm in planar 
graphs, we can state that the temporal complexity is 
0(|£| + \V\log(W\)) = 0(|\/|log(|V|)). 

For applying the proposed approach, we need to reduce 
a graph only once. Then, we can make several shortest 
path search computations. In other words, we propose to 



make a data preprocessing for achieving a performance 
improvement in shortest path search. 

This approach brings us the benefit of performing short- 
est path search in graphs with less vertices than other 
algorithms use, for instance, Dijkstra and A*. Therefore, 
it is logical for the proposal to achieve a lower run time. 
Nevertheless, it is necessary to demonstrate, that the 
path obtained by this proposal is optimal and equal (in 
terms of cost) to the one obtained by Dijkstra's algo- 
rithm. These demonstrations are shown in the following 
section. 

The detailed pseudo-code of the proposed modification 

is presented in Algorithm 3. 

Table 1 shows a comparison of temporal complexity 
of Dijkstra, A* and MDijsktra algorithms. When analyz- 
ing A* algorithm considering optimal heuristics, it can be 
stated that its temporal complexity is 0(«), where n is the 
number of vertices of the graph. Besides, the temporal 
complexity of Algorithm 3 (MDijkstra) is 0(«i log(«i)) < 
0(«j), where «i is the number of vertices of the reduced 
graph. Thus, if in the reduction process we obtain a graph 
G = (Vr,Er), such that mi = \Vr\ = ^/n, the temporal 
complexity of both algorithms must be similar. 



Algorithm 3 mDijkstra 

Input: A reduced and weighted graph G = {V,E,f,R) 

and an origin vertex v origin- 
Output: A vector D of minimum distances, a vector P of 



predecessors. 
1: C„ = {}, queue = PriorityQueueO 
2: for all V e 1^ do 

3: ID^ [ v] —fiyorigint ^origint ^\ «[ ^] — ^origin 

4: queue.add(D„ [ v] , v) 
5: end for 

6: while not queue.emptyi) do 

7: Wn = queue.popQ {The vertex w„ is the pivot} 

8: C„ = C„ U {w„} 

9: for all V e adjacents(w„) do 

10: if D„[v]> Dn[ Wn] +f{Wn, Wn, v) then 
11: Dn[v] = Dn[Wn] +f(Wn, Wn, v) 

12: Pr„ [v] = w„ 

13: queue.decreaseKey{D„ [ v] , v) 

14: end if 

15: if V is a reduced vertex then 
16: for all s e adjacent(Gr, v) do 

17: if Dn[s]> Dn[ Wn] +fr(Wn, V, S) then 

18: Dn\s]=Dn\Wn\-\-fr (Wn , V, s) 

19: P„[s] = V 

20: end if 

21: end for 

22: end if 
23: end for 
24: end while 
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Table 1 Temporal and spatial complexity of Dijkstra, A* and MDIjkstra algorithms 



Algorithm 


Temporal complexity 


Temporal complexity (using Heap data structure) 


Spatial complexity 


Dijkstra 


0(|f| + |\/|2) 


0(|l/| + log(|\/|)) 


0(\E\ + \V\) 


A* 


0(\V\), if the selected heuristic is optimal 


Oi\V\), if the selected heuristic is optimal 


om + \v\) 


M Dijkstra 


0(\E\ + \V\2) 


0(|l/| + log(|\/|)) 


0(|f| + |\/|) + |/?| 



However, as is impractical to obtain an optimal heuris- 
tics for this purpose, we can state that the proposal 
obtains a response in a lower run time than Dijkstra and 
A* algorithm if a condition «i = \ Vr\ < ■\/in) is satisfied. 

Generally, there is a trade off between efficiency and 
accuracy in algorithms that have large amount of data 
as input. The main result of the present work is the 
efficiency improvement of shortest path search in large 
graphs without affecting accuracy. 

We have the possibility to make a shortest path 
search in the reduced graph between any pair of ver- 
tices of the original graph. It can be achieved by apply- 
ing a rewrite rule to a proper reduced vertex. How- 
ever, this involves an additional cost to shortest path 
search. 

It is hard to state that an algorithm for shortest path 
search is better than other in all cases. In this case, our 
proposal need a higher space, associated to a prepro- 
cessing stage to calculate function / (see Definition 4), 
than classical Dijkstra's and A* algorithms (nevertheless, it 
should be highlighted that the preprocessing is made only 
once, but shortest path searches are made several times). 
However, MDijkstra algorithm gives a response in a lower 
run time. 

Below, we prove the correctness of MDijkstra algo- 
rithm, with the aim of establishing that the proposed 
algorithm obtains an optimal path, and the cost of 
this path is the same as the cost of the path obtained 
by Dijkstra's algorithm. Next, we state a theoretical 
measure to ensure that the response time is lower 
than A* algorithm. This is the algorithm selected in 
the literature of shortest path search, to compare run 
times. 

Correctness proof 

In this paper, a new shortest path search algorithm is pro- 
posed. Therefore, it is necessary to prove that the path 
obtained by the proposal is optimal in all cases. 

With the aim of facilitating the understanding of this 
section, the correctness proof of several lemmas is pre- 
sented in Appendix A. 

By Lemma 3, Dpj-i(v) has the minimum distance from 
vertex Vo to vertex v. 

To prove the correctness of Algorithm 3, we shall prove 
that for any path Ca = (Vo, vi, V2, v^) with distance 



vector Dc and predecessors vector P, it holds that Vv e 
V,D]s[-i(v) < DcM-i(v), where v is an unreduced vertex. 

Theorem 1. V« e {l,2,..,Af - l}[Ca(0) = 0 ^ 
Vw < n + l(Ca(m) < N) ^ Wm < n + 
l[Dc{Ca(m)) + fc(Ca(m), Ca(m + 1)) = Dc{Ca{m + 
1))] DN-\{Ca{n)) < DciCain))] 

Proof. (By induction on «) 

Base case n = Q immediate by Lemma 2, 

For n = k+\: 

By Lemma 1 in step AT — 1 all vertices have been visited. 

DN-i(Ca(k + 1)) < DN-i(Ca(k - 1)) + f(Ca(k - 1), 
Caik), Caik + 1)) 

(1) 

The distance to a vertex Vi is less than or equal to the 
distance to a visited vertex Vj plus the distance from vj to 
Vi, by Lemma 5 

By induction hypothesis, Z)Ar_i(Cfl(A^—l)) < Dc{Ca{k— 
1)), replacing Dtq-i{Ca(k — 1)) in (1), 

DM-iiCik + 1)) < Dc(Ca{k - 1)) +f{Ca{k - 1), 
Ca{k), Ca(k + 1)) 

(2) 



Note that Ca{k) = PcM+i(Ca(k + 1)) and Ca(k - 1) = 
PcM-\{PcM-\{Ca{k + 1))), replacing Ca{k) y Ca{k — 1) in 
(2), 

DM-i{Ca{k+l)) < Dc(PcN-i(PcN~i(Ca(k + 1)))) 
+ fiPcN-i (PcN-i(Caik + 1))), PcN-i(Caik + 1)), 
Caik + 1)) 

(3) 

By Lemma 3, Dc(Ca(k+ 1)) = Dc(Pcn-i (Pcn-i (Caik+ 
1)))) + f(PcN-i(PcN~i(Ca(k + m,PcN-i(Ca(k + 
l)),Ca{k + 1)), replacing in (3), DM-i{Ca{k + 1)) < 
DciCaik + 1)). □ 

We can prove the correctness of Dijkstra's algorithm 
with a similar reasoning because the same invariants 
are satisfied. Thus, for the next proof we assume that 
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Dijkstra's algorithm is correct and satisfies invariants anal- 
ogous to those defined for Algorithm 3. 

As demonstrated before, Algorithm 3 returns the short- 
est path in the reduced graph. However, it remains to 
prove that the cost of the shortest path obtained by the 
proposed algorithm and the one obtained by Dijkstra's 
algorithm (in the original graph without reducing it) are 
the same. 

Let: 

- G= (V,£,/c)agraph. 

- Gr = (Vr, Er,f) a reduced graph obtained from the 
graph G. 

Theorem 2. Let Ca = (vi,..., v„) be a path of cost c 
obtained by applying Dijkstra's algorithm on the graph G, 
where Vi and v„ are unreduced vertices on the graph Gr, 
then 3Ca' = (ui, U2, ■ ■ ., ut) with cost c, ui = vi, ut = v„, 
such that Ca' is an optimal path on Gr- 

Proof. From Ca we can build a path Ca' of cost c on the 
graph Gr as follows: 

- Substitute each sub-path v,-, Vi+i, . . ., Vi+m for a path 
Vi, Vk, Vi+m where: 

- Vi+j &[vi],i=l..m 

- Vi, Vi+m are external vertices. The other 
vertices are internal 

- Vk is the reduced vertex (in the graph Gr) that 
represents the equivalence class [ v/] 

The cost of the path v/, v^, Vi+m is equal to the cost of the 
path Vi, Vi+i, Vi+m, by definition of function / Thus, 
the paths Ca and Ca' have the same cost. 

Suppose that exists a path Cb' = {u\, U2,..., Up) of cost 
ci < c in the graph Gr, where e Vr, i = l..p. Then we 
can obtain a path Cb of cost ci on the graph G as follows: 

- Substitute each sub-path m,_i, m,-, by a path 
Ui-i, Uj, Uj+i, Uj+m> ■ ■ ■> Ui+i of cost C3 where: 

- Ui-i, M,+i are unreduced vertices 

- Uj+t €[ui],j=l..m 

- C3 =f(Ui-U Ui, Ui+l) 

Therefore paths Cb and Cb' have the same cost (ci), this 
leads a contradiction. Thus, there is no path that has less 
cost than Ca. □ 

Corollary 1. Let Ca = (vi, v„) a path obtained by 
applying Dijkstra's algorithm on graph G, V/ e {1, 2, n] 
such that Ca[i] is an unreduced vertex in Gr, it holds that 
the distance to Ca[i] is equal to the distance obtained 
by MDijkstra algorithm on the reduced graph from v\ to 
Ca[ i]. 



Theorem 2 establishes that the cost of the shortest path 
from a vertex v, to any vertex Vj (v; and vj being unre- 
duced vertices in Gr) obtained by applying Algorithm 3 
is the same as the cost of the shortest path calculated 
by Dijkstra's algorithm in the original graph (without 
reduction). 

The fact that both source and destination must be unre- 
duced vertices could be a limiting factor (in terms of the 
number of vertices to which one can calculate the short- 
est path) if one does not have a mechanism that allows 
obtaining a reduced graph G„ from Gr where Vi e V (v, is 
a vertex in the original graph G = (V^, £)) is an unreduced 
vertex on G„. This can be accomplished by one or more 
expansions applying rewrite rules to the reduced vertex 
that contains vertex v;. 

Experimental results 

The comparison of the results of shortest path search, 
applying Algorithm 3 (MDijkstra), Dijkstra's algorithm 
and A* algorithm, provides elements emphasizing the 
advantages of the proposed approach. Besides, correct- 
ness proof of the proposed shortest path search algorithm 
is made. 

Algorithm 3 was coded in Python, using the NetworkX 
library (Hagberg et al. 2008). This library provides an 
implementation of Dijkstra's and A* algorithms, allowing 
to compare the three algorithms on the same technol- 
ogy and with efficient data structures. NetworkX uses 
a priority queue, implemented with a Heap, to find the 
shortest path using Dijkstra and A* algorithms. With this 
implementation, the complexity is 0(\E\ + \ V\ log(| V\)). 

It is well-known that there are several techniques to 
make performance improvement on shortest path search, 
based on Dijkstra's and A* algorithms; Zeng and Church 
compare some of them (Zeng and Church 2009). This 
performance improvement depends on several things, for 
example: programming language, data structures used in 
the implementation of algorithms, among others. There- 
fore, in order to be impartial with the proposal, we 
compare the proposed algorithm only with the implemen- 
tation of Dijkstra's and A'* algorithms in the NetworkX 
library. 

The algorithms were run on a Pentium 4 (3.2 GHz) with 
1.5 Gb of RAM and the Kubuntu 11.10 operating system. 

Two graphs were used for experimental test: one was 
obtained from a cartography of the North Carolina 
State^ and the other represents the road network of San 
Francisco''. The first graph, obtained from North Carolina 
cartography, has 41810 vertices. This graph was reduced 
twice. First, we arbitrarily construct two sets of polygons 
using zip codes. The first one has 30 polygons. The sec- 
ond one has 5 polygons (the second set of polygons does 
not depend of the first one). Obviously in the second 
case polygons are larger. In both reductions we use the 
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equivalence relation "in". If two points are into the same 
polygon, then they are related through relation "in". We 
obtain a reduced graph of 1826 using the first set of poly- 
gons, and a reduced graph of 250 vertices using the second 
set. 

The second graph, obtained from San Francisco cartog- 
raphy, has 149756 vertices and it was also reduced twice, 
using the equivalence relation defined above and two new 
arbitrary sets of polygons. The first set has 10 polygons 
and the second one has 4 polygons. In the first reduction, 
using the first set of polygons we obtain a reduced graph of 
2617 vertices. Using the second set of polygons, we obtain 
another reduced graph of 769 vertices. 

Dijkstra's and A* algorithms were executed on the orig- 
inal graphs and the proposed algorithm was applied to 
the reduced ones. Each algorithm was executed 10 times; 
the highest and lowest values were discarded. Finally, the 
average time among the remaining 8 values are shown. 

Table 2 shows a comparison among the three selected 
algorithms based on the run time of shortest path 
search. 

Discussion 

The results shown in Table 2 confirm the fact that, for 
large graphs, the run time of shortest path search with the 
proposed approach would be smaller than the run time 
obtained with classical approaches. 

If in the reduction process we obtain a graph G = 
{Vr,Er), such that «i = \Vr\ = ^Jn, the temporal com- 
plexity of both algorithms (Dijkstra and MDijkstra) must 
be similar. However, as is impractical to obtain an opti- 
mal heuristics for this purpose, we can state that the 
proposal obtains a response in a lower run time than 
Dijkstra's and A* algorithm if a condition «i = \Vr\ < 

Table 2 Time of shortest path search with Dijl^stra's and A* 
algorithms in two original graphs (Gi , Gj) and time of 
shortest path search in four reduced graphs with the 
proposed approach 



Graph Algorithm NV^ Time (seconds) Optimal path 



6i 


Dijkstra 




0.6160 


yes 




A* (h=0) 


41810 


0.4938 


yes 




A* (h=Euclidean distance) 




0.0200 


no 


Gfi.i 


Algorithm 3 (MDijl<stra) 


250 


0.0036 


yes 


Grl.2 


Algorithm 3 (MDijkstra) 


1826 


0.0265 


yes 


62 


Dijkstra 




3.0249 


yes 




A* (h=0) 


149756 


2.2108 


yes 




A* (h=Euclidean distance) 




0.1011 


no 


Gr2.1 


Algorithm 3 (IVlDljkstra) 


765 


0.0193 


yes 


6r2.2 


Algorithm 3 (MDijkstra) 


2617 


0.0722 


yes 



'Number of vertices of the graph. 



.Jn is satisfied. Thus, if we assume that we have suffi- 
cient memory for storing reduced graphs, the proposed 
approach is better than Dijkstra's and A* algorithms; tak- 
ing into account that if we reduce original graph as pro- 
posed before, always we can obtain a response in a lower 
runtime. The proposal is not useful when the available 
memory is low and does not permit to store reduced 
graphs. 

In the case of the run time of Algorithm 3 (MDijkstra) 

on the graph Gri.2. the obtained time is higher than the 
one obtained by A* algorithm. The reason of this result is 
that the graph Gri.2 has a number of vertices considerably 
higher than the square root of the number of vertices of 
Gi. Notice that we state that the number of vertices of the 
reduced graph must be less than or equal to the square 
root of the number of vertices of the original graph. In 
the case of the graph Gr2.2 a lower run time than the one 
obtained by A* algorithm is achieved, although the num- 
ber of vertices is higher than the square root of the number 
of vertices of G2. 

The selection of origin and destination of the shortest 
path search in a GIS is usually made using a map, i.e. a 
user selects these points by clicking in the map shown by 
the GIS. We believe that, at any time that a user selects an 
origin or a destination point, the GIS can make an expan- 
sion of the reduced graph, using the extent of the map 
that is visualized and the selected point. If a system for 
shortest path search is implemented in this way, the time 
needed to expand a reduced vertex would be irrelevant 
for the shortest path search, considering that the temporal 
complexity of expanding a reduced vertex is 0{a), where 
a = max{\Ai\,Ai e P}. 

Most algorithms developed lately for shortest path 
search make efficiency improvement by reducing the 
search space, these approaches cause loss in accuracy. The 
presented approach makes use of a graph reduction algo- 
rithm without loss of information, in order to obtain a 
better run time of the search. This approach maintains the 
accuracy because the reduction algorithm guarantees no 
loss of data (see Table 1). 

Generally, heuristic algorithms are developed in order 
to reduce the run time of a specific algorithm, which 
solves some problems whose optimal solution involves 
a high computational cost. Many heuristic algorithms 
are developed for shortest path search in GIS, with the 
assumption that a low bound of error is admissible in this 
area. However, with the proposed approach, it is possi- 
ble to obtain the optimal path in a similar time, and even 
in less time, than with heuristic algorithms, as shown in 
Table 2. 

Conclusions 

In this paper, an algorithm for shortest path search 
on reduced graphs is developed. Experimental results 
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show that the proposed algorithm is more efficient than 
Dijkstra's algorithm on large graphs. In addition, we can 
conclude the following: 

- The proposed approach is particularly applicable to 
GIS, due to the way in which users perform a shortest 
path search in this kind of systems. This allows us to 
expand vertices avoiding the influence of the time 
used in this operation on the shortest path search. 

- The use of reduced graphs significantly reduces the 
response time in the shortest path search. That is one 
of the two main approaches used in literature to 
reduce the computational cost of this operation. 

- The shortest path search on a reduced graph ensures 
scalability regarding the size of the graph on which 
the analysis is performed. 

- We prove that the proposed algorithm allows us to 
obtain an optimal path in a reduced graph. The cost 
of the obtained path is equal to the cost of the path 
found using Dijkstra's algorithm on the original graph. 

- We have developed a method capable of performing 
shortest path search in a run time similar to A* 
algorithm (with h=0 and h=Euclidean distance). 



Future work 

The modifications made on Dijkstra's algorithm are 
related to the use of a new function that has the cost of 
going through a reduced vertex. Therefore, we can modify 
other algorithms to make shortest path search in reduced 
graph (like A* algorithm), whenever the cost of going 
through a reduced vertex is considered as the cost of the 
path. 

Appendix 

A Demonstration of cycle invariants of Algorithm 3 

Preconditions that must be met to prove the correctness 
of Algorithm 3 are expressed by the following definitions 
and notations: 

- G = (V,E,fc) is a weighted graph. Without loss of 
generality we assume that V = {0, 1, ...,M — 1} to 
make demonstrations less complex. 

- Gr = (Vn Er,f, R) is a reduced graph from G and the 
equivalence relation RE. Without loss of generality 
we assume that Vr = {0, 1, N — 1}. It is important 
to notice that in each path of a reduced graph, 
between two reduced vertices there are, at least, two 
unreduced vertices, as is shown in Figure 4. 

- Vk < AT, in the execution of Algorithm 3 we define: 

- A vertex w„, the vertex selected in step n. 

- A set Cn C Vr, the set of vertices visited in 
step n. Co = {vo], C„+i = C„ \J{wn}. 




Figure 4 Reduced graph example. Vertices 1,2,3 and 4 are reduced 
vertices, tiie rest are unreduced ones. 



- Dyi represents the minimum distance from Vg 
to each vertex v e as far as it is known in 
step n. Dq{vo) = 0, 

D„+i(v) = Min{Dn{Wn) +fciw„,v},Dn(v)) = 

Min{Dn(Pn(Wn)) +f(Pn(Wn), Wn, v),Dn(v)). 

- Pn Store, for each vertex, the predecessor in 
the shortest path from Vg to v^, as far as it is 
known in step n. Pq{Vo) = Vg, 



Pn+iiy) 



Wn iiDn(v) > Dn(w„) +fc{Wn> v) 

Pyiiv) in Other case 



For the correctness proof it is necessary to demonstrate 
that the following cycle invariants are held: 
V« < N: 

1. I C„ I = « + 1. In the iteration n, there are « + 1 
visited vertices. 

2. D„{Vo) = 0 A Pn(Vo) = Vo- The distance from origin 
vertex to itself is 0 at any iteration. The predecessor 
of the origin vertex is the vertex itself. 

3. Dniv) = Dn(Pn(Pn(m +f(Pn(Pn(v)),Pn(v), v). The 

distance to a vertex depends on the distance to its 
predecessor in the shortest path. 

4. Vv e Cn Dn+i{v) = D„(v). The distance to a vertex 
in the step n is the same that the distance in the step 
n + 1, for all visited vertices. 

5. Wvi, vj e V[ vj e C„+i D„+i{vi) < 
Dn+i{Pnivj)) + f(,Pn{vj), Vj, V;)]- The distance to any 
vertex is less than or equal to the distance to a 
visited vertex Vj plus the distance from Vj to Vj. 

Lemma 1. V« < AT, |C„| = « + 1 
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Proof. (By induction on n) 

From the definition of the algorithm, at each step a ver- 
tex w is visited, in step 0 vertex Vo is visited, thus in the 
base case we have Co = {vo}: |Co| = 1, 

For n = k+1, Q+i = Q [J{v}, being v the visited vertex 
in step ^ + 1, therefore I Q+i I = |Q| + |{v}| =k + 2. □ 

Lemma 2. V« < N, D„{vo) = 0 A P„iVo) = Vg 

Proof. First, we visit vertex Vg and update DniVg) = 
0, i.e., the minimum distance from Vg to itself is 
0, the function D„ has its domain in M+|J{0, oo}, 
so the smallest possible value that can be achieved 

is 0; 

Let cost = D„(P„(w„)) + f(P„(w„), w„, v), Vw„, v €V,it 
holds that 0 < 0 + cost, because the image of the function 
/ is R"*" U {0, oo} and the vector Z)(Vr) is initialized from / 

The condition D„(Vo) > D„(Wn) + fiPn(Wn),w„,Vo) 
is never satisfied, thus D„[Vo] and Pn[Vo] never 
change. □ 

Lemma 3. Vn < N, D„iv) = D„iP„iP„iv))) + 
f{P„{P„{v)),P„{v), V) 

Proof. (By induction on n) 
The base case « = 0, Vv e Vr,Doiv) = fciVo,v), by 
preconditions. 

f{vo, Vo, V) = fcivo, Vo) + /c(Vo, v) =/c(V(„ v), by definition 
of f and fc, replacing/ by fc. 

£)o(v) = 0 + /(v<„v<„ v)£)o(v) = £)o(v„) + f{Vo,Vo,v\ by 
Lemma 2 

£)o(v) = DoC-PoCvo)) + /(/'o(Vo), Vo, V), by Lemma 2 
For n = k+1: 

Choose Wk+i e V\Ci( such that Z),t+i(wAr+i) is minimal, 
Ck+i = Ck[J{wi,+i}. 

Case 1: If D^+i(v) > Dk+iiPk+i{Wk+i)) + f{Pk+i 
(wk+i),Wk+i,v), then Dk+i(v) = D^+iiPk+i (w^^+i)) + 

f(Pk+\{Wk+\), Wk+\, V) A Pk+liv) = Wk+l 

Case 2: If case 1 is not satisfied, Dk+i{v) = 
Dkiv),Pk+iiv) = Pkiv), Dk+iiv) = DkiPkiPkiv))) + 
f(Pk(Pk(v)),Pk(v),v), by induction hypothesis, replac- 
ing Pkiv) by Pk+liv) Dk+iiv) = DkiPk+iiPk+iiy))) + 
fiPk+iiPk+iiv)),Pk+iiv),v)- □ 

Lemma 4. Vv e C„ = i)„(v) 

Proof. Let v € C«, e F \ C„ 
w„ G C„+i by definition. 

Dniv) < D„(w„), otherwise vertex w„ was visited before 

vertex v, 

Dniv) < D„iWn) + fciw„,v) = D„{Pn{w„)) + 
f{Pn{W„),W„,v), 

Dn+1 (v) = D„ (v), by definition of D„+i (v). □ 



Lemma 5. Vn < N, Vvj, vj e K[ vj e C„+i ^ 
I^B+iCvO < D„+iiP„ivj)) +fiP„ivj), Vj, Vi)] 

Proof. (By induction on n) 
The base case « = 0, Co = {v/},Z)o(V;) = 0, by definition, 
notice that Vj is the only vertex in Cq (in the base case, if 

Vj e Co, Vj is the origin vertex). 
P„(vj) = Vj by Lemma 2. 

Vvi e V,fivj,Vj,Vi) = /(Po(v;).i'/. 1^0. by definition of/ 
andDo(i'i) = f(vj, Vj, vt). ThusDo(vi) < 0+f(Po(vj), Vj, vi), 
Do(vj) = 0 = DoiPoiVj)) DoiVi) < DoiPoivj)) + 
fiPo(vj),Vj,Vi) 

Di(vi) < Do(vi), from the definition (D„+i(v) = 
MiniD„iw„) + fciwn, v),D„iv))) and i)i(vy) = Doivj) = 0 
(notice that Vj is the origin vertex). Replacing Dq by Di: 
Di(vi) < DiiPoivj)) +fiPoivj),Vj,Vi) 
For n = k+1: 

Case 1: Vj e Ck Dk+iivi) < Dkivi), by definition 
Dk+iivi) < DkiPkivj)) + f(Pk(vj),Vj,Vi), by induction 
hypothesis D^+i(V() < Dk+iiPkivj)) +fiPkivj)>Vj>Vi) by 
Lemma 4 

Case 2: Vj = Wk. 

Dk+iivi) < DkiPkiWk)) +fiPkiWk),Wk,Vi), by definition 
Dk+iiVi) < Dk+iiPkiWk))+f iPkiWk), Wk, Vi), by Lemma 4, 
replacing w„ by vy: 

Dk+iivi) < Dk+iiPkivj)) +f(Pk(vj),V),Vi). 

□ 



Endnotes 

^Available in 

http://grass.osgeo.org/sampledata/north_carolina/ 

''Available in 
http://www.cs.fsu.edu/~lifeifei/SpatialDataset.htm 
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